Processing of multiple media streams

ABSTRACT

A system includes a media studio including a processor and a memory. The memory stores instructions executable by the processor. The processor is programmed to receive data indicating a type of event to be recorded, receive one or more media input items from one or more respective media devices, and generate a media output item based at least in part on the one or more received media input items, and further based at least in part on the data indicating the type of event to be recorded and one or more respective locations of the one or more respective media devices.

CROSS-REFERENCE TO RELATED APPLICATIONS

This application is a continuation of U.S. patent application Ser. No.15/367,745, filed Dec. 2, 2016, entitled “Processing of Multiple MediaStreams,” which application claims priority to and all benefits of U.S.Provisional Patent Application No. 62/263,080 filed Dec. 4, 2015, thedisclosures of which are incorporated by reference in their entirety forall purposes.

BACKGROUND

The use of streamed media data such as video data has increased rapidly.Videos are used for instructional purposes, to record and view sportsevents, to share personal and family events, to sell real estate, etc.Security cameras, traffic cams, etc. provide streams of media data thatmay be collected and monitored at a single location. Drones equippedwith video cameras provide yet another platform for the generation andstreaming of videos. Users may wish to mix data from multiple input datastreams into a single output data stream to view immediately or storefor later viewing.

DRAWINGS

FIG. 1 is a block diagram of an exemplary media processing system forcapturing and mixing multiple media streams into an output media stream.

FIG. 2 is a block diagram of an exemplary camera for the media system ofFIG. 1.

FIG. 3 is a block diagram of an exemplary operator console for the mediaprocessing system of FIG. 1.

FIG. 4 is a block diagram of an exemplary media studio for the mediaprocessing system of FIG. 1.

FIG. 5 is a diagram of a tennis court illustrating exemplary placementand coverage ranges of cameras for recording a tennis match.

FIG. 6A is a diagram of a tennis court including an exemplary coordinatesystem.

FIG. 6B is a diagram of the exemplary tennis court of FIG. 6Aillustrating an exemplary track of the sun.

FIG. 6C is a diagram of the exemplary tennis court of FIG. 6Cillustrating a preferred and nonpreferred range for placement of a firstcamera.

FIG. 6D is a diagram of the exemplary tennis court of FIG. 6D includingplayer benches and indicating a preferred location for placement of asecond and third camera.

DETAILED DESCRIPTION System Overview

A media processing system including a media studio allows the user tobring the studio along to live events. The media studio allows forpreview, e.g., simultaneous preview, of multiple media input items(referred to herein as an aggregated media item), generating a mediaoutput item based on the multiple media input items, sharing the mediaoutput item, and storing the media output item for future viewing, allvia a remote operator console. The media studio may be programmed togenerate recommendations for the media output item and/or generate themedia output item directly based on predetermined rules.

In addition to media input items, the media studio may receive data fromdata collectors, e.g., one or more sensors mounted on or in a mediadevice such as a camera providing a media input, included with anotherdevice carried by a user of the camera, etc. The data may be receiveddirectly from the data collectors, or indirectly, for example viametadata associated with the media input items. The selection of mediainput items for inclusion in the media output item may be based in parton the data from the data collectors.

A variety of types of data may be used by the media studio. For example,the data may include biometric data (e.g., heart rate, blood pressure,etc.) associated with a user of a camera and which may be used, e.g., todetermine an excitement level of the user. As another example, the datamay be data indicative of the quality of a media input item such assteadiness of a camera, contrast of an image, etc. Still further, thedata may include location data or movement data associated with anobject or participant in the event to be recorded (e.g., a player in asports event, a ball being used in a sports event, etc.). Still further,the data may include global positioning data, weather data, lightconditions, etc. related to the recording environment. Other types ofdata, collected from sensors, or, e.g., from other computing devices,may also be used for selecting the content for and/or generating themedia output item.

The predetermined rules for generating the media output items mayinclude making decisions based on the input from the one or more datacollectors. For example, in order to be included in the media outputitem, the media input item may need to meet quality criteria such ashaving a picture contrast within a predetermined range, or beingsufficiently steady (shaking level below a predetermined threshold). Theuse of data from data collectors for determining a media output itemwill be discussed in detail below.

Referring to FIG. 1, a media processing system 10, including a mobilemedia studio 12, can receive media input data from one or more mediadevices 16. The media input data may be received via one or more wiredand/or wireless networks 14 and one or more wired and/or wireless directconnections. Receiving media data may include receiving media data sentfrom a media device 16, e.g., a camera 20, or retrieving data, e.g.,from a storage device 23. Additionally, the media studio 12 may receivedata from one or more data collectors 17. The data collectors 17 mayinclude a variety of sensors including biometric sensors such as heartrate monitors, blood pressure monitors, etc.; movement sensors such asaccelerometers, gyroscopes, etc.; location sensors such as globalpositioning systems, RFID tags, etc.; environmental sensors such asbarometers, thermometers, light sensors, etc.; and other types ofsensors which may provide data related to a recording event. The datacollectors 17 may further include systems such as global positioningsystems (GPS), weather tracking systems, etc.

As used herein, the recording event may refer to the actual event beingrecorded, for example, a football game or company picnic, including theenvironment, participants, camera 20 operators, media studio 12operators, operator console 18 operators, viewers, audience, etc.related to the event being recorded. Further, the data collectors 17 mayprovide data, or may be included in one or more of the media devices 16,and provide data via, for example, a computing device in the mediadevice 16.

The media studio 12 can be controlled by, and receive media input datafrom, an operator console 18, which may be remote to the media studio12. The media studio 12 further can generate media output data andprovide the media output data to media devices 16, e.g., a viewer 22.The system 10 allows a user to produce media output data at a locationwhere the recording (or a portion of the recording) is taking place.Controlling the media studio 12 via a remote operator console 18provides the user the freedom to move about the recording location whilemaintaining oversight and control over the production process.

Exemplary System Elements

Referring now to FIG. 1 in more detail, the media processing system 10includes a media studio 12, one or more media devices 16, and one ormore data collectors 17. The one or more media devices 16 include theoperator console 18, and may further include one or more cameras 20, oneor more viewers 22, and one or more media storage devices 23. The mediadevices 16 and the data collectors 17 may be remote or local to themedia studio 12 and may be coupled to the media studio 12 via at leastone of the network 14 and a direct wired or wireless connection 82.

A server 28 may also be communicatively coupled to the media studio 12,the media devices 16, and the data collectors 17 via the network 14.Additionally, the media processing system 10 may include a monitor 30communicatively coupled to the media studio 12.

The media studio 12 generally can receive multiple media input itemssubstantially simultaneously, and can combine them into an aggregatedmedia item for viewing via the operator console 18. An “aggregated mediaitem,” as that term is used herein, is a set of two or more of the mediainput items, arranged to be displayed substantially simultaneously on auser display, such as a touchscreen, a screen with projected or overlaidimages, etc. The media input items may be arranged such that they appearside by side, in rows, in a picture-in-picture format etc. within theuser display. In addition, the aggregated media item may include, forexample, a graphical user interface which is displayed on the userdisplay and accepts user inputs. As described below, the media studio 12may compress the aggregated media item prior to providing it to theoperator console 18.

The media input items may be visual and/or audio data such as videoscaptured by a video camera, sounds captured by a microphone, scenes froma security camera, etc. The microphone may be integrated in a mediadevice 16, or may be a standalone media device 16 which canindependently communicate with the media studio 12. The media inputitems may be, e.g., streamed data or static data such as single digitalphotographs. The media studio 12 further can receive commands from theoperator console 18, and can generate a media output item according tothe received commands. The media output item may include data from oneor more of the media input items.

As an example, the media studio 12 may receive four media input items.The media studio 12 may generate an aggregated media item including eachof the four media input items, and transmit the aggregated media item tothe operator console 18. A user of the operator console 18 may select,via a user interface, one of the four views in the aggregated media itemto be included in the media output item. Based on a command receivedfrom the operator console 18, the media studio 12 may generate the mediaoutput item that includes the selected media input item. The mediaoutput item may be transmitted via a network 14 to be shared withviewers, or stored in a storage device 23.

In addition to selecting one or more media input items to be included inthe media output item, the media studio 12 may perform various mediaprocessing operations. The media processing operations may be performedbased on commands received from the operator console. A nonlimiting listof example processing operations that may be performed by the mediastudio 12 includes scaling, mixing, morphing, compositing, addingoverlays (audio and/or video), etc. In addition, the media studio mayperform operations such as object tracking, image stabilization, etc.The operation of the media studio 12 will be discussed in greater detailbelow.

The media studio 12 may further be programmed to recommend, e.g., to theoperator console 18, media input items to be included in a media outputitem and/or to generate a media output item based on one or morepredetermined rules. The predetermined rules may be, e.g., input by auser or learned by the media studio 12 based on previous operatorchoices. For example, the media studio 12 processor 62 may be programmedto learn user preferences based on previous choices and may include ahardware learning mechanism such as a neural network, etc. Examples ofrule based selection of media input items for inclusion in a mediaoutput item are discussed below.

Communications between the media studio 12 and the media devices 16 anddata collectors 17 (collectively networked devices 16, 17) may occur viathe network 14 and via direct connections 82, e.g., wired and/orwireless (e.g., Bluetooth, IEEE 802.11, etc.) connections such as areknown. In general, the network 14 represents one or more mechanisms fordelivering media content between the media studio 12 and the networkeddevices 16, 17. Accordingly, the network 14 may be one or more ofvarious wired or wireless communication mechanisms, including anydesired combination of wired (e.g., cable and fiber) and/or wireless(e.g., cellular, wireless, satellite, microwave, and radio frequency)communication mechanisms and any desired network topology (or topologieswhen multiple communication mechanisms are utilized). Exemplarycommunication networks include wireless communication networks, localarea networks (LAN) 24 such as a WiFi network or Ethernet, and/or widearea networks (WAN) 26 such as the Internet, etc.

In addition to the one or more networks 14, one or more wired orwireless direct connections may be used to connect the media studio 12to the media devices 16. Direct connections may include, e.g.,Bluetooth, Universal Serial Bus (USB), high-definition multimediainterfaces (HDMI), custom serial interfaces, etc.

For example, one or more high-definition multimedia interfaces (HDMI)may be used to transfer data between a media device 16 and the mediastudio 12, or from the media studio 12 to a computer or televisionmonitor 30. The HDMI is a well-known proprietary audio/video interfacefor transferring uncompressed video data and compressed or uncompresseddigital audio data from a HDMI-compliant source device such as the mediadevice 16 to a digital media processing device such as the media studio12 or to the compatible computer monitor, e.g., a monitor 30.

The server 28 may be communicatively coupled to the media studio 12, themedia devices 16, and the data collectors 17 via the network 14. Theserver 28 may include a communications circuit for communicating via thenetwork 14, and may further include a memory and one or more processorsconfigured to perform programs, i.e., sets of computer-executableinstructions, stored in the memory. The server 28 may, e.g., receivemedia output items and store the media output items for future use.

Media content, such as the media input items, media output items, and/ormultiview media items, is generally delivered via the network 14 in adigital format, e.g., as compressed audio and/or video data, and mayinclude media data and metadata. For example, MPEG refers to a set ofstandards generally promulgated by the International StandardsOrganization/International Electrical Commission Moving Picture ExpertsGroup (MPEG). H.264 refers to a standard promulgated by theInternational Telecommunications Union (ITU). Accordingly, by way ofexample and not limitation, media content may be provided in a formatsuch as the MPEG-1, MPEG-2, or the H.264/MPEG-4 Advanced Video Codingstandards (AVC) (H.264 and MPEG-4 at present being consistent), oraccording to some other standard or standards. For example, mediacontent could be audio data formatted according to standards such asMPEG-2 Audio Layer III (MP3), Advanced Audio Coding (AAC), etc. Further,the foregoing standards generally provide for including metadata.

Media devices 16 include the operator console 18 and may include one ormore cameras 20, one or more viewers 22 and one or more storage devices23. The operator console 18 may be used to control the operation of themedia studio 12, and in some cases, may also perform as a media inputdevice 16 such as a camera 20, as described below. Media devices 16 mayfurther include data collectors 17, for example gyroscopes,accelerometers, etc., which may provide data related to the media device16 such as movement, location, etc.

A block diagram of an exemplary camera 20 is shown in FIG. 2. The camera20 may include a memory 50 and a processor 51, the memory 50 storingprogram code, i.e., computer-executable instructions, executable by theprocessor 51. The memory 50 may include video buffers which may be usedfor replays, applying video audio filters, compressing and decompressingmedia data, etc. The processor 51 may be communicatively coupled to auser interface 52, a media receiver 54, a communications circuit 56, anddata collectors 17. The camera 20 captures media data, i.e., visual andsound data such as photographs and videos, and transmits the media data,e.g., via the network 14, to the media studio 12. Examples of a camera20 include portable devices such as smartphones, tablets, laptops,digital cameras, etc. Examples of cameras 20 may further includesecurity cameras, traffic cams, cameras transported by airborne drones,etc.

The media receiver 54 includes one or more data receiving elements forreceiving media data. The collected media data may include visual dataand audio data. The media receiver 54 may include, e.g., microphones forreceiving sound data and CMOS or CCD image sensors for receiving imagedata.

The user interface 52 is communicatively coupled to the processor 51 andmay include one or more input devices such as a microphone, buttons, atouchscreen display, a mouse, a keyboard, a gesture-recognition device,switches, etc., for receiving input from the user. The user interface 52may further include one or more output devices such as a display, lamps,speakers, etc. for communicating information to the user.

The data collectors 17 may be used to determine, e.g., operatingconditions of the camera 20, and may include accelerometers, gyroscopes,light meters, etc. The data collectors 17 may be used to measure, forexample, movement of the camera 20 (shaking, tracking of an object), thedirection the camera 20 is pointing, the light conditions under whichthe camera 20 is operating, etc. The data collectors 17 may provide datato the processor 51, which may, e.g., send the data to the media studio12 for additional processing.

For each quality parameter (steadiness, contrast ratio, etc.), a usercan set operational intervals. Operational intervals are defined asranges of parameters related to a media input item which indicate thatthe media item is suitable for further processing. For example, if asteadiness of a camera 20 is not within an operational interval, themedia input item associated with the camera 20 may not be considered bythe media studio 12 for additional processing. The operational intervalin that example may be the amplitude of shaking that is acceptable. Incase all quality parameters are within their respective operationalinterval, an overall quality parameter Q is calculated for each camera20 and/or associated media input item:

$Q_{i} = {C_{i}*{\sum\limits_{k = 1}^{n}\; {{CQ}_{k}*{PQ}_{k}}}}$

in which i is an index corresponding to the ith camera 20, C_(i) is acoefficient corresponding to the ith camera 20 representing qualitycharacteristics (e.g., a professional camera 20 will have a highercoefficient C_(i) than a camera 20 embedded in a mobile phone), k is anindex of types of picture-quality measurements (e.g., k=1 is resolution,k=2 is brightness, k=3 is contrast ratio, . . . k=n is steadiness),CQ_(k) is a weighting factor for the relative importance of the kth typeof picture-quality measurement, and PQ_(k) is a scaled value for the kthtype of picture-quality measurement. The weighting factors CQ_(k) may beset by a system designer or by a user. The values PQ_(k) may be scaledto make different types of picture-quality measurements k commensurable.

The camera 20 may receive and transmit the media data, e.g., via thenetwork 14, or via the direct connection 82, to the media studio 12. Thereceived media data may be, as one example, full high-definition (FullHD) data, providing an image resolution of 1920 by 1080 pixels. Dataformats with other resolutions may also be used.

The camera 20 may further receive data from, e.g., the media studio 12,and provide data, e.g., via the user interface 52, to the user of thecamera 20. For example, as discussed below, the media studio 12 maydetermine, based on data received from the camera 20, that the camera 20is shaking too much, e.g., above a predetermined threshold, or thecontrast of the media input item received from the camera 20 is too low,e.g., below a predetermined threshold. The media studio 12 may send thisdata to the camera 20, which may display the information on the userinterface 52.

The viewer 22 may be used to display media output data received from themedia studio 12, and may include a display such as a liquid crystaldisplay (LCD) or plasma display. The media data may be received, forexample, via the network 14 or via the direct connection 82. Examples ofthe viewer 22 include mobile devices such as mobile phones, tablets, andlaptops and may further include devices such as digital televisions. Theviewer 22 may receive, e.g., Full HD data, providing a resolution of1920 by 1080. Data formats with other resolutions may also be used.

The storage device 23 may store media data and provide an interface toallow the media studio 12 to access the data via the network 14 or viathe direct connection 82. The media storage device may include one ormore types of data storage such as read only memory (ROM), random accessmemory (RAM), flash memory, electrically programmable memory (EPROM),electrically programmable and erasable memory (EEPROM), embeddedMultiMediaCard (eMMC), a hard drive, etc. Further, the media storagedevice 23 may include a processor, programmed, for example, to receivecommands from the media studio 12. The processor may be furtherprogrammed, based on the commands, to retrieve media data items fromdata storage and send the media data items to the media studio 12.

Communications between the storage device 23 and the media studio 12 maybe performed via the network 14. Additionally or alternatively,communications may be performed via the direct connection 82. Forexample, the storage device 23 may be connected to the media studio 12via a Universal Serial Bus (USB) port, or other wired or wirelessinterface.

The operator console 18 may be used to control the operation of themedia studio 12. As shown in FIG. 2, the operator console 18 may includea memory 40 and a processor 41, the memory 40 storing program code,i.e., computer-executable instructions, executable by the processor 41.The processor 41 may be communicatively coupled to a user interface 42,a media receiver 44, a communications circuit 46, and data collectors17.

The user interface 42 is communicatively coupled to the processor 41 andmay include one or more input devices such as a microphone, buttons, atouchscreen display, a mouse, a keyboard, a gesture-recognition device,switches, etc. for receiving input from the user. The user interface 42may further include one or more output devices such as a display, lamps,speakers, etc. for communicating information to the user. All, or aportion of, the user interface 42 may be physically separate from theoperator console 18. For example, the operator console 18 may be atablet computer which projects its output to another screen, (e.g.,air-play), while the operator continues to control the media studio 12from the tablet computer.

The media receiver 44 may be, for example, a digital camera as is knownthat may receive media data. The media receiver 44 may include, e.g., aCMOS or CCD image processor for receiving visual data and a microphonefor receiving audio data. The media data may include visual data such astill photographs and video recordings and may further include audiodata such as a sound recording or soundtrack. The media receiver 44 may,e.g., output the media data to the processor 41.

The communications circuit 46 is communicatively coupled to theprocessor 41 and is configured to communicate with the media studio 12via, e.g., the network 14 and/or through the direct connections 82.

The communications circuit 46 may include a radio frequency (RF)transceiver for WiFi communications (typically 2.4 GHz or 5 GHz bands).The RF transceiver may communicate, for example, directly with a RFreceiver included in the media studio 12. Additionally or alternatively,the communications circuit 46 may include, e.g., an Ethernet interface,a Universal Serial Bus (USB) interface, a Bluetooth transceiver, ahigh-definition multimedia interface (HDMI), etc.

Alternatively, the communications circuit 46 may communicate with themedia studio 12 indirectly, i.e., via an intermediate device. Forexample, the communications circuit 46 may communicate with anotherhotspot, i.e., a communications circuit including a router and clientproviding a connection to a wide area network (WAN) 26 such as theInternet. The media studio 12 may receive the communications via the WAN26.

The data collectors 17 may be used to determine, e.g., operatingconditions of the camera 20, and may include accelerometers, gyroscopes,light meters, etc. The data collectors 17 may be used to measure, forexample, movement of the operator console 18 (shaking, tracking of anobject), the direction the operator console 18 is pointing (for example,the direction the media data collector 44 is pointing), the lightconditions under which the operator console 18 is operating, etc. Thedata collectors 17 may provide data to the processor 41, which may,e.g., send the data to the media studio 12 for additional processing.The data sent to the media studio 12 may be raw data, i.e.,representative of data coming directly from sensors. Additionally oralternatively, the processor 41 of the operator console 18 may performadditional processing of the data. For example, the processor 41 maydetermine values such an excitement level, a quality level, etc. of thedata and provide the determined values to the media studio 12. The datamay be dynamic data which indicates the determined values as a functionof time. Further, the operator console 18 processor 41 may, e.g.,recognize objects within the media input item, perform audio filtering,and perform other media processing operations, and provide the resultsof these operations to the media studio 12.

The processor 41 is communicatively coupled to each of the userinterface 42, the data collector 44, the communications circuits 46, andthe data collectors 17.

The processor 41 is programmed to control the operation of the mediastudio 12 based on inputs received from a user via the user interface42. More specifically, the processor 41 is programmed to receive a mediacontent item, e.g., an aggregated media item including one or more viewsfrom one or more cameras 20, and to display the aggregated media itemvia the user interface 42. The processor 41 is further programmed toreceive input from the user via the user interface 42. For example, theuser may view the aggregated media item and select one of the views tobe included in a media output item generated by the media studio 12. Theprocessor 41 may send a command to the media studio 12 to include theselected view in the media output item.

In addition to commands related to selecting media input items fordisplay in the media output item, commands from the operator console 18may include instructions to perform operations such as scaling, mixing,morphing, compositing, adding overlays, etc. Further, commands from theoperator console 18 may include instructions to perform operations suchas object tracking, image stabilization, etc.

An exemplary media studio 12 is shown in FIG. 4. The media studio 12includes an internal memory 60 and a processor 62, the memory 60 storingprogram code, i.e., computer-executable instructions, executable by theprocessor 62. The processor 62 is communicatively coupled to a userinterface 64, a network interface 66, an auxiliary interface 58, anddata collectors 17.

The internal memory 60 may be, e.g., read only memory (ROM), randomaccess memory (RAM), flash memory, electrically programmable memory(EPROM), electrically programmable and erasable memory (EEPROM),embedded MultiMediaCard (eMMC), a hard drive, etc., and may be used tostore programs executable by the processor 62, as well as to store, forexample, data representing inputs from the user, instructions receivedfrom the operator console 18, media data received from a remote mediadevice 16, media metadata, data collected by data collectors 17, etc.

The user interface 64 is communicatively coupled to the computer 62 andmay include one or more output devices such as a display, lamps,speakers, etc. for communicating information to the user. The userinterface 64 may further include one or more input devices such asbuttons, a microphone, a touchscreen display, a mouse, a keyboard, agesture-recognition device, switches, etc. for receiving input from theuser.

The network interface 66 includes one or more interfaces to the network14. For example, the network interface 66 may include a hotspot, such asis known, for WiFi communications. The hotspot may include a router. Therouter may include a radio frequency (RF) transceiver for WiFicommunications (typically 2.4 GHz or 5 GHz bands) and may receivemultiple transmissions substantially simultaneously. The router mayconnect media devices 16 with the processor 62 of the media studio 12.

Additionally, the network interface 66 may include a link to an InternetService Provider (ISP). The link is a mechanism for connecting to andcommunicating with the Internet Service Provider, e.g., satellitecommunications or a cable network. The link may include a transceiverand antenna for satellite communications, e.g., in the Ka band (18.3-30GHz). The link to the ISP may receive, via the network 14, Internetprotocol (IP) communications from, e.g., media devices 16 and datacollectors 17.

In addition to connecting the media devices 16 with the processor 62 ofthe media studio 12, the router and the Internet client may be used incombination to provide Internet access for media devices 16.

The auxiliary interface 68 may include one or more wired or wirelessinterface circuits which may be used, for example, to connect to one ormore media devices 16. The media devices 16 may include, e.g., one ormore storage devices 23. The auxiliary interface 68 may include auniversal serial bus (USB) interface circuit to communicate withexternal USB devices, for example, a memory stick or memory back-updevice. As another example, the auxiliary interface 58 may include aMicroSD interface, as is known, to store data on and retrieve data froma MicroSD data card. Further, the auxiliary interface 68 may include,e.g., a Bluetooth interface for wireless connection to a media device16. The auxiliary interface 68 may also be used to connect to datacollectors 17.

Additionally, the media studio 12 may include a high-definition mediainterface (HDMI) for connecting to a media device 16, such as a camera20, monitor 30 (as shown in FIG. 1), etc.

The media studio 12 processor 62 is generally programmed to receive oneor more media input items from one or more media devices 16. The mediastudio 12 processor 62 may, in some cases, generate an aggregated mediaitem. The aggregated media item may include, for example apicture-in-picture (PIP) display, wherein two or more of the media inputitems are displayed simultaneously, e.g., side by side, within theaggregated media item. The media studio 12 may transmit the aggregatedmedia item via the network 14 to the operator console 18.

The media studio 12 processor 62 is further programmed to receivecommands from the operator console 18. Based on the commands, the mediastudio 12 may generate a media output item. The processor 62 may selectdata from one or more of the media input items to include in the mediaoutput item. The media studio 12 may further, based on commands receivedfrom the operator console 18, perform various media processingoperations such as scaling, mixing, morphing compositing, addingoverlays, tracking of specific people or objects, smart tagging, etc.related to the generation of the media output item.

In addition, the media studio 12 may perform media processing operationsbased on predetermined rules for generating the media output item.Examples of rules are discussed below.

The media studio 12 processor 62 may output the media output item, e.g.,to viewers 22, to the operator console 18, and/or to other displaydevices. Additionally or alternatively, the media studio 12 may outputthe media output item, e.g., to a server 28, or to storage devices 23,where the media output item may be stored for future use. Variousprocesses which may be performed by the media studio 12 processor 62 aredescribed below.

Processes

Generating an Output Media Item from One or More Input Media Items

Generally, the media studio 12 processor 62 is programmed to receive oneor more media input items from one or more respective media devices 16.The media input items may be received, e.g., via the network 14 or viathe direct connection 82 as described above. The media input items maybe compressed data or uncompressed data. Examples of compressionstandards that may be used for compressed data include H.264/MPEG4,MPEG2, MPEG1, and Advanced Audio Coding (AAC). Additionally, variousresolution formats may be used for the data. For example, the data maybe Full High Definition data providing an image resolution of 1920 by1080 pixels. Other resolutions may also be processed by the media studio12.

Upon receiving the media input items, the processor 62 may decompressthe media input items to prepare the media input items for furtherprocessing. In order to compensate for latency between the multiplemedia input items, the processor 62 may additionally be programmed totime synchronize the media input items. The time synchronization may bedone, for example, during decompression.

Based on the media input items, the processor 62 may be programmed togenerate an aggregated media item and transmit the aggregated media itemto the operator console 18. The aggregated media item may include one ormore of the media input items, arranged to be displayed substantiallysimultaneously on a user display, such as a touchscreen. The media inputitems may be arranged such that they appear side by side, in rows, in apicture-in-picture format, etc. within the display. In addition, theaggregated media item may include, e.g., a graphical user interfacewhich is displayed on the user display and accepts user input. Theprocessor 62 may compress the aggregated media item and may thentransmit to aggregated media item to the operator console 18 via thenetwork 14 or the direct connection 82.

The processor 62 may be further programmable to generate a media outputitem based on the media input items and commands received from theoperator console 18. The processor 62 may receive commands from theoperator console 18 to select one or more media input items to includein the media output item. Additionally, the processor 62 may receivecommands to perform video and audio processing operations such asswitching, mixing, compositing, scaling, morphing, etc.

Upon generating the media output item, the processor 62 may transmit themedia output item to one or more media devices 16 for viewing. Theprocessor 62 may additionally store the media output item for futureuse. Prior to transmission and/or storing of the media output item, theprocessor 62 may additionally compress and/or encrypt the data.

In addition to receiving commands from the operator console 18, theprocessor 62 may further be programmable to perform video processingoperations based on one or more predetermined rules, as described below.

Selecting Input Data for Inclusion in the Media Output Item Based on anExcitement Level

The media studio 12 processor 62 may select a media input item forinclusion in the media output item based on an excitement levelassociated with the media input item. The excitement level may bedetermined based on one or more types of data received from datacollectors 17.

The data may include, for example, biometric sensor data of the user ofthe media device 16 which is generating the media input item. Forexample, one or more data collectors 17 may be associated with a user ofa media device 16. The data collectors 17 may measure, for example, thepulse rate of the user, the blood pressure of the user, the pupildilation of the user, etc., and send the biometric data to the mediastudio 12 processor 62. The processor 62 may determine, based on thebiometric data, for example, that the pulse and/or blood pressure of theuser is elevated, and that the user is excited.

Excitement values may be collected from the data collectors 17associated with particular cameras 20 (e.g., included in a camera 20 orattached to a camera 20 user). An overall excitement parameter E_(i) iscalculated for each data collector 17 (associated with, e.g., a camera20, a position on the recording field, a position near some part of therecording field, etc.):

$E_{i} = {D_{i}*{\sum\limits_{k = 1}^{n}\; {{DQ}_{k}*{EQ}_{k}}}}$

in which i is an index corresponding to the ith data collector 17, D_(i)is an overall correction coefficient corresponding to the ith datacollector 17, k is an index corresponding to a kth type of excitementmeasurement (e.g., k=1 is blood pressure, k=2 is pulse rate, k=3 ispupil dilation, . . . k=n is hand speed to represent cheering orclapping), DQ_(k) is a specific correction coefficient corresponding tothe kth type of excitement measurement, and EQ_(k) is a scaled value forthe kth type of excitement measurement. The overall correctioncoefficient D_(i) represents how much the data from the ith datacollector 17 should be adjusted to produce an expected excitement valuethat corresponds to a real excitement value (e.g., a historicallycalculated value, when, for example, after the event, one or moreexperts determine a scale for evaluating the excitement level). Thespecific correction coefficient DQ_(k) may be used to account for theindividual nature of excitement parameters such as heartbeat rate, bloodpressure, etc. For example, the overall correction coefficient D_(i) andthe specific correction coefficient DQ_(k) may be determined bymeasuring the biometric data of the user in three conditions: a firstcondition when the user is at rest in a low-stimulation environment, asecond condition when Serena Williams appears on the tennis court, and athird measurement when an ace is served, and comparing these values withgeneric values for similar measurements of the general population. Theprocessor 41 of the media studio 12 (or other computing device) maydetermine the specific correction coefficient DQ_(k) for each type ofexcitement measurement k based on the measurements under the threeconditions. Additionally or alternatively, other data, e.g., historicaldata, may be used. EQ_(k) may be scaled to make different types ofexcitement measurements k commensurable.

As another example, the excitement measurements k may further includemovement data associated with the media device 16. For example, themedia device 16 may be moving rapidly in order to follow a play (or aplayer) during a sports event.

As yet another example, the excitement measurements k may includecheering or screaming, for example, by the audience, received by amicrophone associated with the media device 16. For example, a mediadevice 16 may be recording a section of a course associated with abicycle race. When the bicycles approach that section of the course, theaudience may start to cheer. The processor 62 may receive data includingthe cheering and determine, e.g., according to sound volume and/or otherknown sound recognition techniques, that the race has become exciting inthe area of the course being recorded by media device 16.

The processor 62 may further be programmed to distinguish betweendifferent types of excitement. For example, using known soundrecognition techniques, the processor 62 may be programmed to identifythe crying of a baby. The processor 62 may filter out the crying.Alternatively, the processor 62 may, based on the crying, select adifferent media input item, that does not include the crying, fordisplay in the media output item.

As another example, the processor 62 may identify an elevated excitementlevel based on an image included in media data. For example, using knownimage analysis techniques, the processor 62 may detect rapid movementwithin the image, such as a ball or player passing quickly through thefield of view. As another example, the media device 16 may be recordingthe audience, and the processor 62 may detect when a large number ofpeople stand up, or may, based on known facial recognition techniques,detect excited expressions on the faces of the audience.

The media studio 12 processor 62 may, based on one or more of theseexcitement measurements k, select a particular media input item fordisplay in the media output item. In the case of an active useroperating the operator console 18, the processor 62 may recommend aparticular media input item for selection by the user. For example, theprocessor 62 may highlight the media input item within the aggregatedmedia item sent to the operator console 18.

Excluding Blank or Accidental Recordings from the Media Output Item

The media studio 12 processor 62 may be programmed to identify mediainput items that are blank or appear to be accidentally recorded. Forexample, a camera operator may forget that a camera 20 is recording, andpoint an image capture device towards the ground, sun, etc. Theprocessor 62 may receive, for example, data from data collectors 17 onthe camera 20 indicating that the camera 20 is pointing toward theground. Additionally or alternatively, the processor 62 may recognize animage of the ground, determine that the image of the ground is notrelated to the event being recorded, and exclude it from the outputmedia item.

As another example, an operator may forget to turn a camera 20 off, andput the camera in the operator's pocket. A data collector 17 on thecamera 20 may detect, for example, that the camera is in a darkenvironment and provide the data to the processor 62. Alternatively, theprocessor 62 may detect that the media input item received from thecamera 20 is completely black. Based on the data indicating a darkenvironment and/or the image data, the processor 62 may choose toexclude the media input item from the camera 20 from further processing.

Other conditions, such as excessive shaking of the camera 20, highambient light conditions, etc. may also be used to exclude images fromfurther processing.

Selecting a Media Input Item for Inclusion in the Media Output ItemBased on the Rules and Type of Playing Field Related to an Event

The media studio 12 processor 62 may compose a media output item fromone or more media input items based on the rules and type of playingfield related to an event. Based on the rules, the processor 62 may beable to predict where the next action may take place. The processor 62may further know, based on user input, the location of one or more mediadevices 16 relative to a playing field, and the areas of the playingfield being recorded by each of the media devices 16. The processor 62may select to display the media input item from the area of the fieldwhere the next activity is predicted.

For example, and referring to FIG. 5, the event being recorded may be atennis match which is being played on a tennis court 70. First, second,and third cameras 20 a, 20 b, 20 c may be used to record the tennismatch and may respectively be arranged to record first, second, andthird ranges 72, 74, 76 of the court 70 and surrounding area.

The processor 62 may determine for example, based on the rules, that afirst player, serving from the range 72, is about to serve. Theprocessor 62 may select a media input item being generated by the firstcamera 20 a for display in the media output item.

The processor 62 may then, for example, monitor the media input itemreceived from the first camera 20 a and determine that the first playerhas executed the serve. The processor 62 may determine that the ball isnow in play and select a media input item received from the third camera20 c, showing the entire court 70, for display in the media output item.

Upon detecting that play has ended for a particular serve, the processor62 may, e.g., switch back to the first camera 20 a, in order to displaythe next serve from the first player.

The processor 62 may further, for example, based on user input and/oranalysis of image data, keep track of a score for a game and set. When,for example, according to the example above, the game is over, theprocessor 62 may determine that a second player will take over theserve, and the processor 62 may then select a media input item receivedfrom the second camera 20 b for display during serves.

Recommending Camera Setup for Recording an Event

The media studio 12 processor 62 may, for a particular type of event,recommend a preferred setup for cameras 20 capturing the event. Therecommendations may be made, e.g., based on the number of cameras 20available, the shape of the field, knowledge of the event to becaptured, etc.

For example, a user of the media studio 12 may indicate, via the userinterface 64, that the user would like to record a tennis match. Theuser may indicate that three cameras 20 are available for recording thematch. Based on the input, the processor 62 may display, e.g., via theuser interface 64, a diagram similar to the diagram of FIG. 5. Thediagram may recommend locations for each of the first, second, and thirdcameras 20 a, 20 b, 20 c, and the respective first, second, and thirdranges 72, 74, 76 recommended to be covered by each of the cameras 20 a,20 b, 20 c.

The processor 62 may use additional types of data, such as orientationof the playing field, location of the sun relative to the playing fieldduring a game, location of players' benches, etc., to recommendlocations for cameras 20 during a recording event. For example, as inthe example above, the user may indicate that the user would like torecord a tennis match. An example playing field 80 is shown in FIG. 6A.The playing field 80 may be an outdoor playing field 80 for which globalpositioning data is available. The user may establish coordinates of theplaying field 80 relative to, e.g. a north/south axis 84 and aneast/west axis 85, and define a coordinate of 0,0 for a westmost corner81 of the playing field 80.

As shown in FIG. 6B, based on, e.g., the global positioning data, a timeand date of the game, etc., the processor 62 may determine a position ofthe sun 81 relative to the playing field 80 during the anticipated gametime. The processor 62 may further consider, e.g., weather conditions,such as a predicted level of cloudiness during the anticipated gametime. The processor 62 may still further consider, e.g., characteristicsof a camera 20, such as focal length, zoom capability, etc.

Based on the orientation of the playing field 80, position of the sun,weather conditions, characteristics of one or more available cameras 20,etc., the processor 62 of the media studio 12 may determine preferredand nonpreferred ranges for the one or more cameras 20 relative to theplaying field 80. For example, referring to FIG. 6C, a preferred areafor operating a first camera 20 d may be in range 86. Range 86 may allowthe first camera 20 d, based on the focal length, zoom, etc., to capturethe entire playing field without facing the sun 81.

The processor 62 may further identify a nonpreferred range for operatingthe first camera 20 d. For example, the range 87 may be nonpreferred, asrecording the game would require recording while facing the sun 81.

Additional information may be considered for placement of additionalcameras 20. For example, a second camera 20 e and third camera 20 f maybe available to record the game. The user may input to the processor 62,e.g., the location of the player's benches 88, 89. For example, as shownin FIG. 6D, each of the second and third cameras 20 e, 20 f may beplaced on an opposite side of the field from the benches 88, 89. Thesecond camera 20 e may be placed opposite the bench 88, and the thirdcamera 20 f may be placed opposite the bench 89.

In some cases, absolute field location, for example, based on a globalposition system (GPS), may be used. For example, a user may indicate,via the user interface 64, that the user would like to record a sportsevent such as a football game. The user may further input, via the userinterface 64, one or more coordinates related to a football field wherethe game will be played, e.g., the location of one or more of thecorners of the field. Based on the location of the corner and the knowndimensions of the football field, the processor 62 may determinerecommended GPS coordinates for the location of one or more cameras 20to be used for recording the football game.

In some cases, fields may be located in an area without access to GPSdata, such as indoors. In such cases, laser measuring devices may beused to calculate distances from cameras to the field and calculateoptimal camera operator locations.

Tracking Location and Motion of Players and Objects

The media studio 12 processor 62 may be programmed to track a particularparticipant in an event, or to track a particular object. For example,parents may wish for a recording of a sports event to focus on theparticipation of their child. As another example, the processor 62 maybe programmed, for example, to track a basketball being used in abasketball game, such that action near the ball is being captured.

The tracking may be performed via data collectors 17. For example, theplayer or object to be tracked may be equipped with an RFID or othertype of tag. Data collectors 17 may detect the location of the tagrelative to the field of play. As discussed above, the processor 62 mayhave received coordinates of the field of play and a location and rangeof cameras 20 recording the event. Based on the location of the tagrelative to the field and the location and range of the one or morecameras 20, the processor 62 may select a camera 20 and related mediainput item for inclusion in the media output item.

Additionally or alternatively, the location of the player or object maybe determined using known image recognition techniques. The processor 62may be taught to identify the player or object to be tracked and displaya media input item including the player or object.

Based on tracking motion of the player or object, the processor 62 mayfurther be programmed to anticipate where the player or object is going.For example, when tracking a basketball in a basketball game, theprocessor 62 may detect when the ball is moving from one side of thebasketball court toward the other side and predict when the basketballwill appear in the range of a camera 20. The processor 62 may select themedia input item to be displayed based on where the processor 62predicts that the ball will be.

Based on the motion and location data, the processor 62 may further beprogrammed to determine levels of motion within different locations inthe field of play. For example, based on a predetermined rule, anoverall level of motion for each location may be determined. Further,based on the determined overall levels of motion, the processor 62 maydetermine, e.g., one or more locations with an overall motion levelabove a predetermined level or a location with a highest relativeoverall motion level.

For example, the processor 62 may receive motion data from one or moreplayers in a game. The processor 62 may be programmed, e.g., to sum allof the motion of all of the players appearing respectively in each ofseveral locations in the playing field for a particular period of time.Motion may be defined as the total displacement for each of the playersduring the period of time. The overall level of motion may be determinedas the sum of the motion of all of the players for each location may becompared to a predetermined threshold (for example, a total number ofmeters of displacement during the time period within the location).Media input items (from cameras 20) including the locations with overallmotion levels above the predetermined threshold may be included in amedia output item.

Additionally or alternatively, a location with a highest relativeoverall level of motion may be determined. An input item including thelocation with the highest relative overall motion may be included in themedia output item.

Accelerated Playback after Pause (Catch-Up Feature)

During the viewing of an event, for example, a sports event, a viewermay choose to pause the display for a period of time. When the viewer isready to resume watching the event, the viewer may wish to be broughtup-to-date on exciting moments that occurred during the pause. Acomputing device, which may be associated with a viewer 22, anothermedia device 16, the media studio 12, the server 28, etc., may beprogrammed to identify and display the exciting moments, while skippingover less exciting periods of the event.

The computing device may identify segments of the event and develop oneor more excitement ratings for each segment. As described above withregard to excitement levels, the computing device may receive dataindicating an excitement level for each segment. A nonlimiting list ofdata that may be considered in determining the excitement level includesbiometric data associated with one or more camera operators; sound leveldata indicating when the audience is cheering or screaming; image dataindicating quick movement or, e.g., that a score has occurred in thegame; excited facial expressions in audience image data; quick movementof a camera 20 indicating that the camera 20 is following action in thegame; input from a camera operator (e.g., depressing a highlightsbutton) during exciting moments; an increase in social media activityrelated to the event; etc. Based on the data, one or more rules may beconstructed to determine a level of excitement associated with eachsegment of the event.

For example, segments including a score, as determined, e.g., from imagedata, may be awarded a maximum level of 100. An excitement level forother periods may be determined based on a combination of a level ofsound and biometric data from the camera operator. Each segment may,according to this example, be assigned an excitement level between 0 and100.

Following the pause, when the viewer activates the catch-up feature, thecomputing device may display, e.g., chronologically, the segments withan excitement level above a threshold. In order to provide context, thecomputing device may display segments including a first predeterminedperiod of time prior to, and/or segments including a secondpredetermined period of time after, the segments that are above thethreshold. The first and second predetermined time periods may be, forexample, 10 seconds.

The threshold may be a default threshold, for example, an excitementlevel of 70 or higher. Alternatively, the viewer may specify anexcitement level threshold for segments to be watched during thecatch-up mode. As another alternative, the viewer may, e.g., specify howlong the catch-up period should last. Based on a length of a catch-upperiod provided by the viewer, the computing device may calculate theexcitement level threshold that generates the appropriate amount ofplayback material. The computing device may then display,chronologically, all of the segments with an excitement level above thecalculated threshold.

Creating a Highlights Media Item

The media studio 12 processor 62 may be programmed to, during therecording of an event, generate a highlights media output item. Thehighlights media output item may include, e.g., only exciting segmentsduring the event. The highlights media output item may be, e.g., storedin a storage device 23 or transmitted to the server 28 for later use.

As described above, with regard to the catch-up feature, a media item(recording of an event) may be divided into one or more segments, eachsegment representing a period of time. An excitement level may beassigned to each segment. Segments with an excitement level above apredetermined threshold may be included in the highlights media outputitem. As another example, the processor 62 may be programmed to generatea highlights media item to capture scoring drives/plays. Based onanalysis of the game being recorded, the media studio 12 processor 62may identify scoring drives and/or plays. The processor 62 can create ahighlights media item which includes the scoring drive/play. Thehighlights media item may include, e.g., a predetermined period beforeand after the scoring drive/play. The additional content before andafter the scoring drive/play may provide context to a user, allowing theuser to differentiate each highlight segment. A highlight media item maybe based on an individual game, multiple games happening on a same day,multiple games played by a particular team during a season, etc.

Note that, in addition to excitement level, other factors may be used increating a highlights media output item. As an example, the highlightsmedia output item could feature views that include a particular player.

Generating Replays for Peak Moments of an Event

The media studio 12 processor 62 may be programmed, during the recordingof an event, to identify peak moments that occur during the event. Peakmoments may be time periods during the event when an excitement level isabove a certain predetermined threshold. The excitement level may bedetermined as described above.

When a peak moment is identified, the media studio 12 may include one ormore replays of the moment in the media output item. The replays mayinclude different views of the peak moment, taken, for example, fromdifferent media input items received by the processor 62.

The media input items to be included in the replays may be selected,e.g., according to the location of the camera 20 relative to the actionof interest, according to the amount of movement in the respective mediainput item, based on the presence of a player in the particular mediainput item, etc.

CONCLUSION

As used herein, the adverb “substantially” means that a shape,structure, measurement, quantity, time, etc. may deviate from an exactdescribed geometry, distance, measurement, quantity, time, etc., becauseof imperfections in materials, machining, manufacturing, etc.

The term “exemplary” is used herein in the sense of signifying anexample, e.g., a reference to an “exemplary widget” should be read assimply referring to an example of a widget.

Networked devices such as those discussed herein generally each includeinstructions executable by one or more networked devices such as thoseidentified above, and for carrying out blocks or steps of processesdescribed above. For example, process blocks discussed above may beembodied as computer-executable instructions.

Computer-executable instructions may be compiled or interpreted fromcomputer programs created using a variety of programming languagesand/or technologies, including, without limitation, and either alone orin combination, Java™, C, C++, Visual Basic, Java Script, Perl, HTML,etc. In general, a processor (e.g., a microprocessor) receivesinstructions, e.g., from a memory, a computer-readable medium, etc., andexecutes these instructions, thereby performing one or more processes,including one or more of the processes described herein. Suchinstructions and other data may be stored and transmitted using avariety of computer-readable media. A file in a networked device isgenerally a collection of data stored on a computer readable medium,such as a storage medium, a random access memory, etc.

A computer-readable medium includes any medium that participates inproviding data (e.g., instructions), which may be read by a computer.Such a medium may take many forms, including, but not limited to,non-volatile media, volatile media, etc. Non-volatile media include, forexample, optical or magnetic disks and other persistent memory. Volatilemedia include dynamic random access memory (DRAM), which typicallyconstitutes a main memory. Common forms of computer-readable mediainclude, for example, a floppy disk, a flexible disk, hard disk,magnetic tape, any other magnetic medium, a CD-ROM, DVD, any otheroptical medium, punch cards, paper tape, any other physical medium withrules of holes, a RAM, a PROM, an EPROM, a FLASH-EEPROM, any othermemory chip or cartridge, or any other medium from which a computer canread.

In the drawings, the same reference numbers indicate the same elements.Further, some or all of these elements could be changed. With regard tothe media, processes, systems, methods, etc. described herein, it shouldbe understood that, although the steps of such processes, etc. have beendescribed as occurring according to a certain ordered sequence, suchprocesses could be practiced with the described steps performed in anorder other than the order described herein. It further should beunderstood that certain steps could be performed simultaneously, thatother steps could be added, or that certain steps described herein couldbe omitted. In other words, the descriptions of processes herein areprovided for the purpose of illustrating certain embodiments, and shouldin no way be construed so as to limit the claimed invention.

Accordingly, it is to be understood that the above description isintended to be illustrative and not restrictive. Many embodiments andapplications other than the examples provided would be apparent to thoseof skill in the art upon reading the above description. The scope of theinvention should be determined, not with reference to the abovedescription, but should instead be determined with reference to theappended claims, along with the full scope of equivalents to which suchclaims are entitled. It is anticipated and intended that futuredevelopments will occur in the arts discussed herein, and that thedisclosed systems and methods will be incorporated into such futureembodiments. In sum, it should be understood that the invention iscapable of modification and variation and is limited only by thefollowing claims.

All terms used in the claims are intended to be given their plain andordinary meanings as understood by those skilled in the art unless anexplicit indication to the contrary in made herein. In particular, useof the singular articles such as “a,” “the,” “said,” etc. should be readto recite one or more of the indicated elements unless a claim recitesan explicit limitation to the contrary.

1. (canceled)
 2. A method for generating a media output item from aplurality of media input, the method comprising: receiving a pluralityof media input items each from a respective one of a plurality of mediadevices, each media device having a respective one or more sensors tocapture an excitement level of a respective user of the media device,such that each media input item comprises a respective excitementmeasurement indicating the excitement level of the respective user ofthe respective media device during capture of the media input item;generating an aggregated media item automatically by combining two ormore of the media input items to be displayed simultaneously within theaggregated media item; displaying the aggregated media item in anoperator console; generating a media output item by selecting aparticular one of the plurality of media input items for display in themedia output item based on the respective excitement measurement for theparticular one of the plurality of media input items; and outputting themedia output item.
 3. The method of claim 2, wherein: the generating theaggregated media item comprises time-synchronizing the two or more ofthe media input items.
 4. The method of claim 2, wherein: each mediadevice is associated with a respective overall correction coefficient;and selecting the particular one of the plurality of media input itemscomprises normalizing the respective excitement measurements based onthe respective overall correction coefficients.
 5. The method of claim4, wherein, for each media device: each of the one or more sensors ofthe media device is associated with a respective specific correctioncoefficient; and the respective overall correction coefficient for themedia device is based on the respective specific correction coefficientsof the one or more sensors.
 6. The method of claim 2, furthercomprising: receiving one or more commands from an operator via agraphical user interface of the operator console responsive to thedisplaying, wherein the generating the media output item is furtherbased on the one or more commands.
 7. The method of claim 6, furthercomprising: displaying a graphical indication via the graphical userinterface to recommend to the operator the particular one of theplurality of media input items, wherein the one or more commandsindicate a selection by the operator of the particular one of theplurality of media input items responsive to the graphical indication.8. The method of claim 2, further comprising: transmitting theaggregated media item via a network to the operator console prior to thedisplaying, wherein the receiving, the generating the aggregated mediaitem, and the transmitting are performed by a first computationalsystem, and the operator console is implemented on a secondcomputational system communicatively coupled with the firstcomputational system over the network.
 9. The method of claim 8, furthercomprising: compressing the aggregated media item prior to thetransmitting via the network.
 10. The method of claim 2, wherein: thegenerating the media output item is further by performing video and/oraudio processing operations on the particular one of the plurality ofmedia input items.
 11. The method of claim 2, wherein each respectiveexcitement measurement is derived from at least one biometricmeasurement of the respective user.
 12. A system comprising a mediastudio including one or more processors and one or more memories, theone or more memories storing instructions executable by the one or moreprocessors such that the one or more processors are programmed to:receive a plurality of media input items each from a respective one of aplurality of media devices, each media device having a respective one ormore sensors to capture an excitement level of a respective user of themedia device, such that each media input item comprises a respectiveexcitement measurement indicating the excitement level of the respectiveuser of the respective media device during capture of the media inputitem; generate an aggregated media item automatically by combining twoor more of the media input items to be displayed simultaneously withinthe aggregated media item; display the aggregated media item in anoperator console; and generate a media output item by selecting aparticular one of the plurality of media input items for display in themedia output item based on the respective excitement measurement for theparticular one of the plurality of media input items.
 13. The system ofclaim 12, wherein the one or more processors are further programmed tooutput the media output item.
 14. The system of claim 12, wherein theone or more processors are programmed to generate the aggregated mediaitem by time-synchronizing the two or more of the media input items. 15.The system of claim 12, wherein: each media device is associated with arespective overall correction coefficient; and the one or moreprocessors are programmed to select the particular one of the pluralityof media input items by normalizing the respective excitementmeasurements based on the respective overall correction coefficients.16. The system of claim 15, wherein, for each media device: each of theone or more sensors of the media device is associated with a respectivespecific correction coefficient; and the respective overall correctioncoefficient for the media device is based on the respective specificcorrection coefficients of the one or more sensors.
 17. The system ofclaim 12, wherein the one or more processors are programmed further to:receive one or more commands from an operator via a graphical userinterface of the operator console responsive to the displaying, whereinthe one or more processors are programmed to generate the media outputitem is further based on the one or more commands.
 18. The system ofclaim 17, wherein the one or more processors are programmed further to:display a graphical indication via the graphical user interface torecommend to the operator the particular one of the plurality of mediainput items, wherein the one or more commands indicate a selection bythe operator of the particular one of the plurality of media input itemsresponsive to the graphical indication.
 19. The system of claim 12,wherein: the one or more memories comprise a first memory and a secondmemory; the one or more processors comprise at least: a first processordisposed in a first computational system and coupled with the firstmemory, and a second processor disposed in a second computational systemand coupled with the second memory; the first memory has, storedthereon, a first portion of the instructions executable by the one ormore processors, such that the first processor is programmed to receivethe plurality of media input items and to generate the aggregated mediaitem, and such that the first processor is further programmed totransmit the aggregated media item via a network to the secondprocessor; and the second memory has, stored thereon, a second portionof the instructions executable by the one or more processors, such thatthe second processor is programmed to display the aggregated media itemresponsive to receiving the aggregated media item from the firstprocessor via the network.
 20. The system of claim 12, wherein the oneor more processors are programmed to generate the media output itemfurther by performing video and/or audio processing operations on theparticular one of the plurality of media input items.
 21. The system ofclaim 12, wherein each respective excitement measurement is derived fromat least one biometric measurement of the respective user.